The Capacity of the 
Quantum Multiple Access Channel 



Andreas Winter 



O 
O 

(N 



> 

ON 

o 
o 

oo 

On 
Oh 

cr 



X 



Abstract — We define classical— quantum multiway channels 
for transmission of classical information, after recent work 
by Allahverdyan and Saakian. Bounds on the capacity re- 
gion are derived in a uniform way, which are analogous to 
the classically known ones, simply replacing Shannon en- 
tropy with von Neumann entropy. For the single receiver 
case (multiple access channel) the exact capacity region is 
determined. These results are applied to the case of noisy 
channels, with arbitrary input signal states. 

A second issue of this work is the presentation of a cal- 
culus of quantum information quantities, based on the alge- 
braic formulation of quantum theory. 

Keywords — quantum channel, multiway channel, coding, 
capacity. 



I. Introduction 

Classical multiway channels were already studied by 
Shannon Jj"6[ |. Ahlswede |Q, Q first determined the ca- 
pacity region of the channel with s senders and r receivers, 
where all senders want to transmit independent messages, 
which all receivers should get. For a good overview on mul- 
tiuser communication theory in general consult [||, or the 
textbook @. 

In the present paper we define the corresponding quan- 
tum channel (after Allahverdyan and Saakian jj|), extend- 
ing the definition of a classical-quantum channel (see H). 
Our motivation is twofold: in the first place, it is a very 
common situation that many users want to communicate 
via the same transmission system, and all real systems 
should be described by quantum mechanics. Then, sec- 
ondly, we feel that it helps understanding quantum com- 
munication if we try to solve questions known in a clas- 
sical context for quantum channels. This the more, as for 
Holevo's results on quantum channels (coding theorem and 
information bound) not only the question and its answer, 
but even the method of solution is rather close to classically 
well known mathematics (see |ll| ) , and we should find out 
whether this similarity extends further. 

The results of the present work are: we bound the ca- 
pacity region, the actual bounds being obtainable from the 
classical case by formally replacing Shannon entropy by 
von Neumann entropy in the expressions, thus following a 
general principle or feeling in physics. The central result 
is a proof of the direct coding theorem for the multiple ac- 
cess channel (one receiver: r = 1), using the technique of 
Holevo || and Schumacher/ Westmoreland |l5[ , which was 
designed to solve the single-sender case. 

The author is with Fakultat fur Mathematik, Universitat Biele- 
feld, Postfach 100131, 33501 Bielefeld, Germany. Electronic address: 
winter@mathematik.unl-bielefeld.de. Research supported by the 
Deutsche Forschungsgemeinschaft via SFB 343 "Diskrete Strukturen 
in der Mathematik" . 



The outline of the paper is as follows: in section |lj the 
basic definitions are stated, in particular quantum multi- 
way channels are formally introduced. Section |l| reviews 
notation and facts about quantum information quantities 
we shall need. In the following section IV we prove the 
outer bounds for the capacity region. Sections |y| contains 
a central result on the state disturbance o f a measurement 
with high success probability. In section [vi] this result is 
used to prove the direct coding theorem for the quantum 
multiple access channel. In the last section VII we com- 
ment on the quantum-quantum multiway channel which 
may be fed with arbitrary input states. 

The results of the present work are part of the au- 
thor's Ph.D. thesis pH , mainly chapter III (with alterna- 
tive proofs), and appendix A. 

II. Quantum Multiway Channels 

This is the simplest situation of multi-user communica- 
tion in general: consider s independent senders, sender i 
using a (finite) alphabet Xi , say with an a priori probabil- 
ity distribution Pi. This alphabet serves as a set of tags 
of different actions each user may take, such that a signal 
appears in the output system, composed of the effects of 
these s independent choices, and the channel noise. To the 
output system the r receivers have partial access, and their 
task is to each reconstruct the s messages the senders chose 
to send. This is to be achieved by block-coding and via a 
previously agreed coding/decoding scheme. 

Formally, this model is captured as follows: the channel 
is simply a map 



W : X x x 



x X„ 



6(2J) 



from the input alphabets into the set 6(2)) of states of 
the (finite dimensional) C*-algebra 2), mapping the input 
(x%, . . . ,x s ) to the output state W Xl ... Xs . Without loss of 
generality we may assume that 2) = 2(TC) is the full oper- 
ator algebra of the finite dimensional Hilbert space 7i, and 
we shall assume that the W Xl x are density operators on 

However, to express the theory in this algebraic manner 
has its merits, as we shall see: 

The output state (generally mixed) is accessed by sev- 
eral receicers. These are represented by commuting *— 



subalgebras 2Jj (j = 1, 



") of 2): the meaning is that 



receiver j may use any measurement (POVM) whose ele- 
ments belong to 2) j . The commutativity ensures that these 
measurements can be performed together. The typical case 

1 ln the general case we may make use of the fact that the states of 
2) are uniquely described by density operators inside 2J. 
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of this situation is that 2) = 2Ji <g> • • • <X) 2)r, where we 
identify 2Jj with the subalgebra l®^- 1 ) ® 2)j ® of 
2}. By linear extension we may view as a completely 
positive, trace preserving map from Xi <g> ■ ■ ■ ® X s to 2): 
where X, = CAf, is the commutative algebra of C~valued 
functions on Xi, whose elements we identify with their in- 
dicator functions.^ If all the W Xl ... Xs commute with each 
other (hence have a common diagonalization) the channel 
is called quasi-classical, and classical if 2) is a commutative 
algebra. 

It should be stressed that all this can be embedded into 
standard quantum theory by identifying all algebras in 
question with operator algebras in some sufficiently large 
Hilbert space, e.g. a commutative algebra with a set of di- 
agonal matrices. To get more familiar with this formalism 
the reader might consult a book like . 

For fixed a priori distributions define the channel state 

7= X! F ii x i) ■ ■ ■ Ps(x s )xi <E> ■ ■ ■ <E) x s <E)W Xi ... Xb 

Vi XiGXi 

on Xi ® • • • ® X s ® 2). This serves as the quantum analogue 
of the joint distribution of the random variables represent- 
ing input and output letters in the classical case. It may 
be interpreted as the joint state of the system after the 
channel usage, where the senders kept a record of their 
individual letters (this is possible because they input clas- 
sical information, reflected in the classical nature of their 
systems). 

Note the 1-1-correspondence between states 7 on Xi <£> 
• • • ® X s ® 2J and pairs (P, W) of channels W and probabil- 
ity distributions P on X\ x • • • x X s . This is a feature of our 
model, which relies on the commutativity of the Xi (com- 
pare |l3| for the difficulties encountered in more general 
situations) . 

We will employ block coding on the discrete memoryless 
channel generated by W: for sequences x™, . . . ,x™, ar™ = 
xn . . . Xi n £ X- 1 , the n-block channel 

W n : X? x ••• x X* -» 6(2J®") 

is defined by 

w x ^... xns =w XllX21 ... Xsl ®w Xl2X22 ... Xs2 ®- ■ -®W XlnX2n ... X!!n . 

We now introduce some notation to describe the channel 
as seen by a subset of senders, while the others enter only 
stochastically: 

For a J C [s] = {1, . . . , s} denote Pj = ® ieJ Pi, i.e. 
Pj{x l \i G J) = Y[ l£j Pi{xi), and X(J) = U. ieJ Xi (simi- 
larly X(J) = igJ X^. 

Define the reduced channel PjoW : A'(J) — > 6(2)) by 

(iV»W)( X4 | <6J) = ]T Pj.(zi|ie J c )W Xx ... Xs . 

2 Since all algebras here are finite dimensional we do not care about 
the topological distictions necessary in general, between linear spaces 
and their duals, between maps and their adjoints. 



(Here J c denotes the complement [s] \ J of J in [s] ) . Note 
that 

Tr x(JC)1 = p A*i\*e-0®*M p J°W)lx t \ieJ)- 
VieJ-. xieXt ie.T 

Transmission is now by using codes on n-blocks: 

An n-block-code is a collection (/1, ... , f s , D\, . . . , D r ) 
of maps fi : M.i — * A?" (where Mi is the set of messages of 
sender i) and decoding observables (POVMs) Dj C 2J® ra , 
indexed by Mi x • • • x A^ s , i.e.: 

-Oj = { D m e ?)j : A* e Xi x • • ■ x 7W S }, 

such that 

There are r (average) error probabilities of the code, the 
probability that the receiver j guesses incorrectly any one 
of the sent words, taken over the uniform distribution on 
the message sets: 

ej(fi,... ,f s ,Dj) = 

1_ m 77 - \m s \^J t 



We call (/1, . . . ,f s ,Di,.., , D r ) an (n, e)-code if all error 
probabilities e.j{fi, ■ ■ ■ ,f s ,Dj) do not exceed e. 

The rates of the code are the R4 = ^ log \M%\. A tuple 
■•■ ,R S ) is said to be achievable, if for any e,5 > 
there exists for any large enough n an (n, e)-code with i— 
th rate at least Rt — 6. The set of all achievable tuples 
(which is clearly closed) is called the capacity region of the 
channel, and to determine this region is the problem to be 
addressed here. 

Some observations should be made: first, the capac- 
ity region is convex, by the time sharing principle: let 
(Ri, ... ,R S ) and (R[, ... ,R' S ) be rate tuples of m- and 
n-block codes, respectively, with error probability e each. 
By concatenating the codewords to (m + n)-blocks, and 
tensoring the corresponding decoding observables, we get 
an (m + n)-block code with error probability at most 2e, 
and with rates ^R t + ^flj. 

Second, note that in the multi-user situation not a sin- 
gle number describes the performance of the channel (as 
with capacity in the single-sender case). Instead, only with 
given behaviour of the other senders the channel gets a 
specific capacity for a particular sender. Intuitively, this 
is because the others' (unknown!) actions may be seen as 
additional noise (a phenomenon known as "interference" in 
classical multi-user channels). 

III. Information Quantities in Quantum Systems 

In this section we introduce some notation in which we 
express our results. From fl7|| , appendix A, we use the 
definitions of various information quantities for observables 
and *-subalgebras, which we review for the sake of self- 
containedness: 
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Let 21 be a C*-algebra, and p a state on it. For a *- 
subalgebra 25 we want to define the entropy of p with re- 
spect to this subalgebra (we shall stress the dependence on 
25, as p is supposed to be fixed). To this end let us consider 
the restriction a — p\<s of p to 25, and define 

77(03) = if p (93) =-Tr(<7log(r). 

Here Tr is the unique trace on 25 (i.e. a positive C-linear 
functional on 25, with the properties Tr AB = TrBA and 
Tr A* = TrA), that assigns 1 to all minimal idempotents 
of 25. An important example is the usual trace of £(7Y), 
in which case the formula gives the familiar von Neumann 
entropy of the state. 

Motivated by identities for classical Shannon entropy 
we may now define, for (elementwise) commuting *— 
subalgebras 25 and C the conditional entropy 

H(<B\£) =#(»£)-#(£), 

and the mutual information 

7(25 AC) = 77(25) + 77(C) - 77(25C) 
= 77(25) - 77(25|C). 

The condition that the algebras commute is crucial here: 
it ensures that all observables in 25 are coexistent with all 
observables in £, and also, that the product 25£ is indeed 
the algebra generated by 25 and C Of course, these defi- 
nitions are only formally derived from well known classical 
formulas, and there is no reason to expect that they are 
meaningful (which indeed they are onlyto a limited de- 
gree: see the discussions in and in U]). Anyhow, for 
our purposes they make sufficient sense. 

If X) is a third *-subalgebra, commuting with both 25 and 
C we may finally define the conditional mutual information 

7(25 A £]£>) = 77(2512)) + 77(£|D) - 77(25£|S) 

= 77(251)) + 77 (CD) - 77(25CD) - 77(£>). 

We note, that the conditional mutual information is posi- 
tive, by the strong subadditivity of von Neumann entropy 
(see @, theorem A.9). 

In all the above expressions we supressed the dependence 
on the underlying state p. In cases of possible ambiguity it 
is added as a subscript. 

With these definitions we have the (easily checked) iden- 
tities for the system introduced in section y, with the chan- 
nel state 7: 

7(X(J) A 2J|X(J C )) = 7(X( J) A 2JX(J C )) 

= 77(2J|X(J c ))-77(2J|Xi---X s ) 
= H(PjW\Pjo) - H[W\P [S] ), 

where in the last line an alternative notation is used: 

For a channel V : A — > ©(3) and a probability distribu- 
tion Q on A let 

H(V\Q)= J2Q(a)H(V a ), 



with the von Neumann entropy 77: so this is the familiar 
writing of a conditional as an average of entropies. 

There are a number of important relations between all 
these quantities, of which we shall make use of two: 

Lemma 1: Let Vk '■ At — > <5(3fc) (k = 1, 2) be two chan- 
nels, and Q a probability distribution on Ai x Ai- Forming 
the channel state 

7= ^2 Q(ai,a 2 )ai <gs Vi >ai ® a 2 O F 2 , a3 , 

a h EA h :k=l,2 

we have the following subadditivity relation: 

7(C4iC4 2 A 3i3 2 ) < I(CAi A 3i) + 7(C4 2 A 3 2 ). 
Proof: This is well known for classical channels, and 
the proof in our case runs exactly the same. Compare fl7|| , 
theorem A. 17. ■ 
Lemma 2 (Fano inequality) Let X, 2) be commuting al- 
gebras, and X be commutative. For a state p on X2) con- 
sider POVMs X C X, Y C 2)> running over the same index 
set. 

Then the probability of the event "A 7^ Y" , i.e. 

3 

satisfies 

ff(X|2J) < l + P e logTrl£. 
Proof: See fl7f| , corollary A. 25. Observe that the 
statement of the lemma is a way of expressing the Holevo 
bound 0. ■ 

IV. Upper Capacity Bounds 

The following theorem (which we call the weak converse 
because of theorem and note [|) was, in the case r = 1 
and s = 2, stated in ||. 

Theorem 3 (Weak converse) The capacity region of the 
quantum multiway channel is contained in the closure of 
all nonnegative (7?i, ... , R s ) satisfying for all J C [s] and 
3 e [r] 

R(J) = Ri < 9 « 7 -r« ( X ( J ) A W( JC )) 

ie J u 

for channel states 7„ and q u > 0, J2 U 9« = 

Proof: Let (/1,... ,f s ,Di,... ,D r ) be any (n,e)- 
code with rate tuple (Ri, ... , R s )- Then the uniform dis- 
tribution on the codewords induces a channel state 7 on 
(Xi---X s 2J)®": 

7= \ MA l ..\ M I E ® W^( mi ).../.(m.)- 

1 1 1 " l Vi: mtZMt i 

Its restriction to the w-th copy in this tensor power will 
be denoted j u . Let j £ [r], J C [a]: by Fano inequality 
(lemma ^|) we have 

77(X 8n (J)|2jf l X®"(J c )) < 1 + I ■ nR(J). 
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With 

H(X® n (J)\%lf n X® n (J c )) 

= iJ(X®"(J)) - /(£® n (J) A 2J®"X®"(J C )) 
= nR(J) - 7(X®"(J) A 2)f ™X®"(J C )) 



we conclude now 

(1 - e)R(J) <- + -lJX® n (J) A 2jf "X®"( J c )) 
n n J 

1 1 " 

using lemma 1 (subadditivity of mutual information). ■ 
Remark 4- For classical channels the region described in 
the theorem is the exact capacity region (i.e. all the rates 
there are achievable), as was first proved by Ahlswede M, 
^ . This fact is our reason to call it the weak converse, as 
it describes the best outer bounds of (n, e)-code rates for 
n — > oo and e — > 0. 

To prove that for multiple access channels (r = 1) this 
holds, too, is the object of the rest of the paper, though we 
conjecture it to be true in general. 

Remark 5: The numerical computation of the above re- 
gions is not yet possible from the given description: we 
need a bound on the number of different single-letter chan- 
nel states one has to consider in the convex combinations. 
For the multiple access channel (r = 1) this is easy: by 
Caratheodory's theorem s will suffice. For general r it is 
possible to show that r(2 s — 1) are sufficient (cf. ||). 

V. Measurement Error and Disturbance 

In this section a central result is proved that essentially 
states that if a POVM serves to indentify the states of 
an ensemble with high probability, then it may be imple- 
mented as an operation that disturbes the ensemble states 
very little. 

Lemma 6: Let p be a state, and X a positive operator 
with X < 1 and 1 - Tr (pX) < e < 1. Then 

||p- y/XpVXWt < VSc, 

with the trace norm ||A||i = Tr |X|. 

Proof: See @, lemma V.9. ■ 
Lemma 7 (Tender measurement) Let p a {a G ,4) be a 
set of states on 21, and D an observable indexed by B. Let 
further tp : A — > B be any map and e > such that 



VaEA 1 - Tr {p a D v{a) ) <e 



(1) 



(i.e. the observable recognizes <p(a) from p a with max- 
imal error probability e). Then the quantum operation 
S : 6(21) -» 6(21) defined by 



S : p i 



beB 



D bP ^D h 



disturbes the states p a only a little: 



The quantum operation A : 6(21) -> &(CB ® 21) with 
A : P l-> Y b ® V~^bP\f~Db 



beB 



has the property that 



VaeA y{a)®p a - A(p a )||i < V8e + e. 
Proof: It suffices to prove the second statement since 
the first inequality is obtained from it by a partial trace 
which does not increase || • II i: 



\<p(a) <8> p a - A(p )|| x 



< 



Pa 



- JD 



■ V {a)Pa^D v{a) 
+ W^DbPa^/D 



l 

bill 



< 



Pa - \l D v ( a )p a ^l D v ( a) 

^8(1-Tr(p a £> v(a) )) 
+ (l-Tv(p a D v{a) )) 
<V8e + e, 



using triangle inequality and lemma g. ■ 
Lemma 8 (Average version) With the same definitions 
as in lemma 0, only replacing equation (|l|) with 



Y P(a) (1 - Tr (p a D v(a) j) <e, 



(2) 



a£A 



for a probability distribution P on A, we have 
Y p (a)\\p a -S(pa)h < Vsi+e 



a£A 



and 



P(a)\\<p(a) ®Pa- A(p„)||i < V8E+ e. 



VaeA 



\\Pa-Hpa)\\i < V8~e + . 



a£A 

Proof: Again, we have only to prove the second state- 
ment. Introducing e a = 1 — Tr(p a D^( a )) we have, like in 
the previous proof, 

||<p(a) <8> p a - A(p a )|| 1 < \/8e^+ e a . 

Forming the average of the left hand side under the dis- 
tribution P, and using concavity of \A8a; + x the assertion 
follows. ■ 

VI. Quantum Multiple Access Channel: Coding 

Throughout this section we will assume r = 1 and else 
notation as before. 

Theorem 9: Let R\, . . . , R s be nonnegative, satisfying 
for some a priori distributions Pi on the Xi the constraints 

VJc[s] ^i?, < 7(X(J) A2J|X(J C )). 

ie.J 



Then for every e, 8 > and all sufficiently large n there are 
(n, e)-codes with rates ^ log \Mi\ > Ri — 5. 

Proof: It is sufficient by the time sharing principle 
to prove the assertion only for the upper extremal points 
of the region described, and by symmetry we may assume 
(for i = 1, . . . , s) that 

Ri = A 2)*! •••£*_!) 

= i/(2)|X 1 ---X l _ 1 )- J ff(2}|X 1 ---X l ) 

= H(P {>i _ 1} W\P { <^ 1} ) - H(P {>l} W\P { < l} ). 

That these are indeed the upper extreme points is proved 
in the appendix. 

The idea of the following construction is to first decode 
the message mi from sender 1, using only the incoming 
signal. Then decode the message m-2 from sender 2, using 
mi and the incoming signal (which is almost undisturbed 
by the tender measurement lemma). Iterate, until you de- 
code message m s from sender s, using mi, . . . , m s _i and 
the still almost unchanged incoming signal. 

Let e, S > 0, and consider s families of codewords Ci C 
XV- of size Li = \Ci\ = |"2 Tl ^- 5 )] , drawn independently 
from XJ 1 according to the a priori distribution P® n . 

Fix i for the moment and define the following channel: 
for x? £ XV- 



pW : x n 



W 

Px" 



L\ ■ ■ ■ Li-iL 



i+l 



E 



Icy ® W^. 



Vj/i: c"GCj \j<i 



(we denote these word states by pW, in contrast to the 
letter states W). Note that this is the channel belonging 
to the channel state 7 from the proof of theorem ||, reduced 
to (Xi---X l _i2))®". 

(i) 

The average of p x A over the choice of C\, . . . , Ci-i, 
Cj+i, . . . ,C S is indeed a product state: 

Vjj&i: x"£X? j<i 

= P{<i}( X l\j < i )<g) X ]®( P {>i}W) x :n„. x n 

Vj<i: x"£X™ ]<i 

= Vl 

X? ' 



with 

xi . . .Xi • 

Vj<i: XjZiXj j<i 

In M and p5| a construction of a decoding observable for 
the channel V and set Ci of codewords is described,^ which 
has the property that it's average error probability 

e v »(C js A) = l-4 T E Tr (K™ A.cj 1 )) 

1 c" G C, 

3 Observe that in this way will be independent from the other 
codes and their decoding observables! 



averaged over the choice of Ci, is at most e/s for large 
enough n: 

(e Vn {C u D.i)) c% <e/s 

(where we identified the set of messages with Ci). This is 
because I{Pi\ V) — I(Xj A 2}Xi ■ ■ • X;_i) = Ri. (Recall the 
approach of || and : a random code — drawn according 
to P® n — is chosen with rate slightly below I {Pi] V), and 
a decoding POVM constructed which forces the expected 
average error probability small. Then it is concluded that 
a code with this small error probability actually exists). 

We note that by the construction from j| and JTHJ it is 
assured that 



A 



for all c™ E Ci. It is easlity seen that we may assume 
this w.l.o.g., for wherever the Di comes from: the are 
density operators on some Hilber space H, such that 

V% e (Xi---X^i2))®" c £(W). 

Denoting this subset embedding by E, we have 

Tr(y c |A, c? ) = Tr(^(K|)A,c ? ) = Tr (V| £*(A,c ? )), 

with the adjoint map E* . I.e. the decoding error probabil- 
ities do not change if we replace the A,c™ by P*(A,c™) £ 

ix,---x, :iD 

Now we confront the decoder Di with the signal p, ob- 
taining an average error probability 



(C i7 A) = l-^ J! ^(PcfAc?)- 



cfed 



Obviously the code average over choice of Cj, j 7^ i is 
exactly eV™(Ci, A), so by the previous argument 



ei 



-«/Ci...C, 



< e. 



This means that there actually exist codebooks C\, . . . ,C S 
with all the Ei < e. Now consider the corresponding op- 
erations Aj of the observables Di, from lemma |?]. By the 
average tender measurement lemma || we find for each i: 



LvL s 



E 



< V8e-\ 



e. 



Their concatenation A = A s o • • • o Ai will be the decoder 
process: it satisfies 



1 



/•!••• /•- 



E 



.71 o 



A»...c») 



< s(v / 8e + e 



since each step introduces a deviation of at most v8e + e, 
and does not increase trace norm. 
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Now, discarding the signal itself (i.e tracing out 2)®") 



we arrive at A = Tr < 



o A, which satisfies 



Li- 



lc?-A(W?»... c „) 



< s(V8e + e). 



This means, that A is a quantum operation with classical 
outcomes in C\ x • • • x C s (hence described by a POVM 
D indexed by this set), which recovers rather accurately 
(c™, . . . , c") from W^n >i>c n, on average. ■ 

Now we may combine the (weak) converse theorem |^ 
and the foregoing direct coding theorem |^ (together with 
the time sharing principle by which the capacity region is 
convex) to obtain 

Theorem 10: The capacity region of the quantum multi- 
ple access channel is the convex closure of all nonnegative 
(Ri,... ,R S ) satisfying 

VJc[s] ]Ti?, </(£(J) A2)|X(J C )) 

for some input distribution and corresponding channel 
state. ■ 

VII. Quantum-Quantum Multiway Channels 

Allahverdyan and Saakian in S actually defined quan- 
tum multiway channels in more general form: namely al- 
lowing in the definition of section [0] general C*-algebras Xi 
(and not only commutative ones as we did). Each sender 
i may then use any set of states on Xi for transmission. 
Formally, let 



tp : &(Xi 



6(?)) 



be any completely positive, trace preserving, linear map, 
i.e. a quantum operation on states. A code for sender i 
now consists of a map Fi : A4i — > &(Xf n ), the decoders 
are still observables in the algebras 2)j. This allows for 
definition of error probabilities and rates, and hence of the 
capacity region, the details of which we leave to the reader. 

By composition we can view this as a classical-quantum 
channel on rt-blocks: 

(ffl 1 ,...,m,)H ^® Tl (Fi(mi) <g> • •• ® F s (m s )). 

If we now make the restriction that in coding the senders 
have to use product states, i.e. 



with Fi^ mu k £ &{Xi), we see immediately that the ar- 
gument from section IV applies, and gives a general up- 



per bound theorem (with the slight difference that now 
in the formation of the channel states 7„ instead of the 



states W 



/i(mi)„.../,(m,), 



there occur the output states 



<p{Fi ® ^,m,,«))' 

Conversely: those input states and a distribution on 
them being chosen we can apply the coding theorem ^| to 



see that these bounds can be approached — at least for the 
multiple access channel. 

We have thus obtained the capacity region in the 
1-separable case, i.e. for input states that are separable 
(it is easy to see that best performance in this case is for 
product states). Of course we could allow the sender to use 
blocks of states which entangle at most v systems, getting 
a possibly larger region R.M. This is because now the 
subadditivity of the mutual information, used in the upper 
bound in section IV, docs not hold any longer. We leave 
open the question of the ultimate capacity region R = 
(J„ R(") (which even for the case s = r = 1 is unsolved): 
whether it be equal or strictly larger than RW. 

VIII. Conclusion 

We introduced quantum multiway channel and bounded 
its capacity region. In the case of the quantum multiple ac- 
cess channel this led to a complete characterization of the 
capacity region, formally identical to the result in the clas- 
sical case. This confirms the validity of the principle that 
in the theory of classical information transmission over a 
quantum channel capacities are obtained from the classical 
formulas by replacing Shannon entropy by von Neumann 
entropy. 

The next step is of course to prove the direct coding the- 
orem in the case of several receivers. It seems that for this 
a refinement of the methods used here or even a different 
approach is needed: namely, it is known that it is not suf- 
ficient to achieve rate tuples of the form used in the proof 
of theorem ||. Hence the method of sequentially decoding 
the messages of the different senders has to be abandoned. 

A note on the literature: Since the findings of the 
present paper (published at that time only as an e-print 



( quant-ph/9807019| )) an article by Huang, Zhang, and 
Hou |10f| on quantum multiple access channels has ap- 
peared, where the 2-sender pure state case of a multiple 
access channel is treated. 

In that work a reference to the above statement was 
made concerning the wish for a more direct proof of the 
achievability of the region described in the coding theo- 
The authors propose to supply such a proof. How- 
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rem 

ever, they too employ the present paper's method to show 
achievability of the "corners" (compare theorem ^|), and 
then invoke the time-sharing principle. As was pointed 
out here this method is not adapted to prove the theorem 
for multiple receivers: it is still an open problem to find 
a proof that directly allows to achieve every point of the 
capacity region. We most probably know how to perform 
the random code selection, but the analog of the maximum 
likelihood decoder necessary to decode all messages simul- 
taneously still evades us (for detail how this strategy works 
in the classical case, see H). 
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Appendix 

Extremal points 

We shall prove: any upper extremal point of the region 
of non-negative tuples (Ri,... ,R S ) which satisfy for all 
JC[ S ] 

R(J) < I(X(J) AP(J C )), (J) 
for fixed channel state, is one of the ... ,R S ) with 

Rw(i) = I{%Tr(i) A ?)3-7r(l) ' ' ' ^Tr(i-l)) 

for some permutation ir of the set [s], and any such tuple 
belongs to this region. 

Let (Ri, . . . , R s ) be an upper extremal point, hence s of 
the inequalities (for nonempty subsets K\, . . . ,K S of [s]) 
are met with equality. We are done if we can prove that 
these can be chosen to form a strictly increasing chain ^ 
JCiCK 2 C...cA' s = [s], w.l.o.g. Ki = {s, s - 1, . . . , s - 
i + 1}, because then 

II, /i.V, ---.W • ^.l": •••A\ i 

- I(X l+1 ■ ■ ■ X s A 2}Xi • • • X % ) (3) 
= /(X l A2)Xi---X ! _i). 

Choose any one such set, say A ^ 0: 

R{K) = I(X(K) A 2)X(A C )). 

We would like to employ induction to extend this A in 
both directions to a chain. That we can do so follows from 

Lemma 11: If R\,. . . ,R S > are such that with ^ 
A C [s\: R{K) = I(X(K) A2)X(A C )) and (J) holds for all 
J C A and for all A C J C [s]. Then (J) holds for all 
Jc[s}. 

Proof: First let A C J C [s]. Then (J), together 
with (A) met with equality, implies 

R(J \ K) < A(2)|X(J C )) - A(2)|X(A C )) 

< A(2)|X((J\A) C ))-A(2)|X([ S ])) 
= /(X(J\A)A2)|X((J\A) c )), 

which is equation (J \ A). There the second estimate is 
by strong subadditivity applied to 2li = X(J \ K), 2t 2 = 
2)X(J c ),a 3 = X(A). 

Now let J C [s] be arbitrary and write J = J1UJ2 with 
J\ = J n A, J2 = J \ A. Then by assumption and the 
previous 

R(J) = A(Ji)+A(J 2 ) 

< A(2)|X(Jr))-A(2)|X([ S ])) 

+ A(2)|X( J 2 C n A c )) - A(2)|X(A C )) 
= A(2)X(jr)) + A(2)X(A C fl J 2 C )) - A(2)X(A C )) 
- A(X((A \ J!)U(A C \ J 2 ))) - A(2)|X([ S ])) 

< A(2}X(J! C n J 2 C )) - A(X(Jf fl J 2 C )) - A(2)|X([ S ])) 
= ff(2)|X(J 1 c n J 2 C ))-A(2)|X([ S ])) 

= I(X(J) A2)|X(J C )). 



There the second estimate is again by strong subadditivity, 
with 2li = X(J 2 ), 2l 2 = 2}X(A C n J|), 2l 3 = X(A \ Ji). ■ 

It follows that if (A') is met with equality for any A', 
also (A' fl A) and (A' U A) are met with equality. It is 
easily seen that both below and above A there must occur 
equalities (except already \K\ = 1 or A = [s]). 

It remains to show that (i?i, ... , R s ) defined by ([|) be- 
longs to the region. But this is clear by the same argument 
as above, used top-down: the conjunction of all (J) with 
J C [s] is implied by the conjunction of all with J C [s — 1] , 
this in turn inductively by all with J C [s — 2] etc. 
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